Enhancing Cyber Security through the use of Synthetic Handwritten CAPTCHAs
نویسندگان
چکیده
Online services which allow users to contribute content and interact remotely over the internet in some manner are common today. Many of these services, like spam control for blogs and email account sign-up, require that they be accessed only by humans and not machines (automated scripts or bots). One method of differentiating between humans and bots is by using a CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart). A number of different genres of CAPTCHAs exist (text-based, visual, auditory, and cognitive). Text-based CAPTCHAs are popular because automatic recognition of degraded, noisy, distorted text with background clutter is still a challenging task for machines, but is a task that humans perform with relative ease. However, recently a significant number of printed-text based CAPTCHAs have been successfully attacked by bots, thereby rendering the services they protect vulnerable to attack. Thus there is an urgent need for exploring alternate CAPTCHAs and this serves as the prime motivation for our research. We have explored three primary tracks of investigation in this thesis. First, we have defined CAPTCHA design principles based on an exploit-avoid-resist paradigm. Second, we have improved the effectiveness of text-based CAPTCHAs by substituting printed text with handwritten text and then layering on additional cognitive tasks. We have developed a fullyautomated framework for synthetic handwriting generation for this purpose. Prior work in this area has focused on synthesizing handwritten textlines to conform to a particular user’s style. We have developed fully automated techniques for simulating non writer-specific handwriting by extracting principal curves from handwritten characters which serve as a set of control points to allow character-level distortion. We have used novel techniques for character baseline detection and ligature parameterization to construct the textlines. A parameterized sinusoid-based function is used to allow random perturbation of these textlines. We have generated CAPTCHAs of varying machine-difficulty levels and have shown them to be more effective than machineprinted CAPTCHAs. Finally, we have developed a new class of interaction-based CAPTCHAs, which require an entity to interact with the challenge in order to gain access to the solution space. Traditional textbased CAPTCHAs can be solved if an entity can successfully complete a recognition task. Traditional image-based CAPTCHAs can be solved if an entity can successfully complete the twin tasks of cognition and recognition. We show how the interaction-based CAPTCHA combines the best of both worlds and then adds a third layer of complexity by requiring an entity to successfully complete three tasks – interaction, cognition, and recognition – to be able to solve a CAPTCHA challenge. Specifically, we have developed a 3D shadow CAPTCHA which uses aspects of 3D scene rendering, ray casting, and perspective projection to present unique challenges to machines while remaining intuitive for humans to solve.
منابع مشابه
Generation and Performance Evaluation of Synthetic Handwritten CAPTCHAs
In this paper we explore the potential of handwriting for use in CAPTCHAs. A synthetic handwriting generation method is presented, where the generated textlines need to be as close as possible to human handwriting without being writer-specific. The primary application of such a synthetic generator is in the design of handwritten CAPTCHAs (Completely Automatic Public Turing Test to Tell Computer...
متن کاملSEIMCHA: a new semantic image CAPTCHA using geometric transformations
As protection of web applications are getting more and more important every day, CAPTCHAs are facing booming attention both by users and designers. Nowadays, it is well accepted that using visual concepts enhance security and usability of CAPTCHAs. There exist few major different ideas for designing image CAPTCHAs. Some methods apply a set of modifications such as rotations to the original imag...
متن کاملExploiting the Gap in Human and Machine Abilities in Handwriting Recognition for Web Security Applications
Automated recognition of unconstrained handwriting continues to be a challenging research task. In contrast to the traditional role of handwriting recognition in applications such as postal automation, bank check reading etc, in this dissertation we explore the use of handwriting recognition for cyber security. HIPs (Human Interactive Proofs) are automatic reverse Turing tests designed so that ...
متن کاملSynthetic handwritten CAPTCHAs
Article history: Received 9 August 2008 Received in revised form 30 November 2008 Accepted 19 December 2008
متن کاملCyber security using arabic captcha scheme
Bots are programs that crawl through the web site and make auto registrations. CAPTCHAs, using Latin script, are widely used to prevent automated bots from abusing online services on the World Wide Web. However, many of the existing English based CAPTCHAs have some inherent problems and cannot assure the security of these websites. This paper proposes a method that focuses on the use of Arabic ...
متن کامل